Relevance Score of Triplets Using Knowledge Graph Embedding - The Pigweed Triple Scorer at WSDM Cup 2017
نویسندگان
چکیده
Collaborative Knowledge Bases such as Freebase [1] and Wikidata [2] mention multiple professions and nationalities for a particular entity. The goal of the WSDM Cup 2017 [3] Triplet Scoring Challenge was to calculate relevance scores between an entity and its professions/nationalities. Such scores are a fundamental ingredient when ranking results in entity search. This paper proposes a novel approach to ensemble an advanced Knowledge Graph Embedding Model with a simple bag-of-words model. The former deals with hidden pragmatics and deep semantics whereas the latter handles text-based retrieval and low-level semantics. 1. TASK INTRODUCTION Many entities usually have multiple professions or nationalities, and it is often desirable to rank the relevance of these individual triplets. The goal of the challenge was to compute a score in the range [0, 7] that measures the relevance of the statement expressed by the individual triplet compared to other triplets from the same relation. Participants were provided with a list of 385,426 entities along with five files, • profession.kb: all professions for a set of 343,329 entities • nationality.kb: all nationalities for a set of 301,590 entities • profession.train: relevance scores for 515 tuples (pertaining to 134 entities) from profession.kb • nationality.kb: relevance scores for 162 tuples (pertaining to 77 entities) from nationality.kb • nationality.kb: 33,159,353 sentences from Wikipedia with annotations of the 385,426 entities Apart from these, the participants were allowed to use any kind or amount of additional data (except for human/judgements). The output of this task was to generate relevance scores for all the triplets, 0 being the lowest relevance, and 7 being the highest.
منابع مشابه
Predicting Relevance Scores for Triples from Type-Like Relations using Neural Embedding - The Cabbage Triple Scorer at WSDM Cup 2017
The WSDM Cup 2017 Triple scoring challenge is aimed at calculating and assigning relevance scores for triples from type-like relations. Such scores are a fundamental ingredient for ranking results in entity search. In this paper, we propose a method that uses neural embedding techniques to accurately calculate an entity score for a triple based on its nearest neighbor. We strive to develop a ne...
متن کاملIntegrating Knowledge from Latent and Explicit Features for Triple Scoring - Team Radicchio's Triple Scorer at WSDM Cup 2017
The objective of the triple scoring task in WSDM Cup 2017 is to compute relevance scores for knowledge-base triples of typelike relations. For example, consider Julius Caesar who has had various professions, including Politician and Author. For two given triples (Julius Caesar, profession, Politician) and (Julius Caesar, profession, Author), the former triple is likely to have a higher relevanc...
متن کاملTriple Scoring Using a Hybrid Fact Validation Approach - The Catsear Triple Scorer at WSDM Cup 2017
With the continuous increase of data daily published in knowledge bases across the Web, one of the main issues is regarding information relevance. In most knowledge bases, a triple (i.e., a statement composed by subject, predicate, and object) can be only true or false. However, triples can be assigned a score to have information sorted by relevance. In this work, we describe the participation ...
متن کاملTriple Scoring Using Paragraph Vector - The Gailan Triple Scorer at WSDM Cup 2017
In this paper we describe our solution to the WSDM Cup 2017 Triple Scoring task. Our approach generates a relevance score based on the textual description of the triple’s subject and value (Object). It measures how similar (related) the text description of the subject is to the text description of its values. The generated similarity score can then be used to rank the multiple values associated...
متن کاملPredicting Triple Scoring with Crowdsourcing-specific Features - The fiddlehead Triple Scorer at WSDM Cup 2017
The Triple Scoring Task at the WSDM Cup 2017 involves the prediction of the relevance scores between persons and professions/nationalities. The ground truth of the relevance scores was obtained by counting the vote of seven crowdworkers. I confirmed that features related to task difficulty correlate with the discrepancy among crowdworkers’ judgement. This means such features are useful for pred...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1712.08353 شماره
صفحات -
تاریخ انتشار 2017